翻訳と辞書 |
Most frequent k characters : ウィキペディア英語版 | Most frequent k characters
In information theory, MostFreqKDistance is a string metric technique for quickly estimating how similar two ordered sets or strings are. The scheme was invented by ,〔 and initially used in text mining applications like author recognition. Method is originally based on a hashing function MaxFreqKChars classical author recognition problem and idea first came out while studying on data stream mining. Algorithm is suitable for coding in most of the programming languages like Java, Tcl, Python or J. 〔 〕 ==Definition== Method has two steps. * Hash input strings str1 and str2 separately using MostFreqKHashing and output hstr1 and hstr2 respectively * Calculate string distance (or string similarity coefficient) of two hash outputs, hstr1 and hstr2 and output an integer value
抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Most frequent k characters」の詳細全文を読む
スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース |
Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.
|
|